Identifying Topic-Related Hyperlinks on Twitter
نویسندگان
چکیده
The microblogging service Twitter has become one of the most popular sources of real time information. Every second, hundreds of URLs are posted on Twitter. Due to the maximum tweet length of 140 characters, these URLs are in most cases a shortened version of the original URLs. In contrast to the original URLS, which usually provide some hints on the destination Web site and the specific page, shortened links do not tell the users what to expect behind them. These links might contain relevant information or news regarding a certain topic of interest, but they might just as well be completely irrelevant, or even lead to a malicious or harmful website. In this paper, we present our work towards identifying credible Twitter users for given topics. We achieve this by characterizing the content of the posted URLs to further relate to the expertise of Twitter users.
منابع مشابه
Beyond Twitter Text: A Preliminary Study on Twitter Hyperlink and its Application
While the popularity of Twitter brings a plethora of Twitter researches, short, plain and informal tweet texts limit the research progress. This paper aims to investigate whether hyperlinks in tweets and their linked pages can be used to discover rich information for Twitter applications. The statistical analysis on the analysed hyperlinks offers the evidence that tweets contain a large amount ...
متن کاملOn-line Trend Analysis with Topic Models: \#twitter Trends Detection Topic Model Online
We present a novel topic modelling-based methodology to track emerging events in microblogs such as Twitter. Our topic model has an in-built update mechanism based on time slices and implements a dynamic vocabulary. We first show that the method is robust in detecting events using a range of datasets with injected novel events, and then demonstrate its application in identifying trending topics...
متن کاملDetection of illicit online sales of fentanyls via Twitter
A counterfeit fentanyl crisis is currently underway in the United States. Counterfeit versions of commonly abused prescription drugs laced with fentanyl are being manufactured, distributed, and sold globally, leading to an increase in overdose and death in countries like the United States and Canada. Despite concerns from the U.S. Drug Enforcement Agency regarding covert and overt sale of fen...
متن کاملAutomatic Humor Classification on Twitter
Much has been written about humor and even sarcasm automatic recognition on Twitter. The task of classifying humorous tweets according to the type of humor has not been confronted so far, as far as we know. This research is aimed at applying classification and other NLP algorithms to the challenging task of automatically identifying the type and topic of humorous messages on Twitter. To achieve...
متن کاملIdentifying Health-Related Topics on Twitter - An Exploration of Tobacco-Related Tweets as a Test Topic
Public health-related topics are difficult to identify in large conversational datasets like Twitter. This study examines how to model and discover public health topics and themes in tweets. Tobacco use is chosen as a test case to demonstrate the effectiveness of topic modeling via LDA across a large, representational dataset from the United States, as well as across a smaller subset that was s...
متن کامل